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DETAILED ACTION 

1 . Claims 1 2, 1 8-21 , 29 and 34-44 are pending. Claims 34 and 40-42 are 
withdrawn. Claims 12, 18-21, 29, 35-39, 43 and 44 are examined. 

Continued Examination Under 37 CFR 1.114 

2. A request for continued examination under 37 CFR 1 . 1 1 4, including the fee set 
forth in 37 CFR 1 .17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1.17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 
9/8/2004 has been entered. 

Election/Restrictions 

3. The Requirement for Election/Restrictions, as set forth and considered in Office 
actions mailed 1 0/1 0/2001 , 3/27/2002, 1 0/1 7/2002 and 3/26/2003; and applicant's 
election of Group I (originally claim 12), without traverse, is maintained. Reservations 
regarding applicant's election of species, as discussed in the Office action, mailed 
3/26/2003, are noted for the record. 

4. Claims 34 and 40-42 remain withdrawn from further consideration as being 
drawn to non-elected species, there being no allowable generic or linking claim. 

Withdrawn Claim Rejections 

5. The rejection of claim 38 under 35 U.S.C. 112, first paragraph, for new matter, is 
withdrawn in view of applicant's amendment to the claim, filed 9/8/2004. 
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Maintained Claim Rejections 
Maintained Claim Rejections - 35 USC § 102/103 

The text of those sections of Title 35, U.S. Code not included in this action can 
be found in a prior Office action. 

6. Claims 12, 18-21 , 29, 35-39, 43 and 44 are rejected under 35 U.S.C. 102(b) as 
anticipated by or, in the alternative, under 35 U.S.C. 103(a) as obvious over Palfreyman 
et al., US 4,421,767. This rejection maintains the reasons of record, as set forth in the 
previous Office actions, mailed 1 1/12/2003 and 3/26/2003. The rejection of claim 44 is 
necessitated by applicant's addition of claim 44, filed 9/8/2004. 

Response to Arguments 

Applicant's arguments filed 9/8//2004 have been fully considered but they are not 
persuasive. Applicant argues that the reference of Palfreyman et al., US 4,421,767, 
does not disclose a combinatorial library, as defined in the instant application. Applicant 
argues that Palfreyman does not disclose a collection of compounds as instantly 
claimed, but instead discloses individual compounds represented as a Markush group. 
Applicant state that the compounds in the claimed library are related by location and by 
preparation. Applicant states that individual compounds disclosed as formula I, II or III 
in Palfreyman, does not disclose a collection as claimed in instant claim 12. 

Applicant further argues that Palfreyman does not provide any teaching or 
suggestion to do that which applicant has done. Applicant argues that Palfreyman 
discloses individual compounds by formula I, II or III in a Markush format, and individual 
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compounds are not the same as a collection of compounds or a combinatorial library, as 
claimed. 

Applicant further states that "the combinatorial library of the instant claims 
encompasses mixtures of compounds and a collection of individual, pure 
compounds prepared by the process recited in the claim. The claim is not directed to 
individual compounds." Applicant argues that the instant claim recites the feature that 
the plurality of compounds in the combinatorial library are prepared by the process 
recited in the claim. Again, applicant states that the library can be made as a mixture of 
compounds, or as a collection of individual pure compounds, depending on the methods 
used for identification of active compounds. Applicant states "[w]hen the library is made 
as a mixture of compounds, the compounds in the library are related by location as 
members of a common mixture having unique chemical and/or physical characteristics. 
When the library is made as a collection of individual pure compounds, the compounds 
are related by location as individual compounds distributed in a combinatorial array." 
Applicant states that the amendment to claim 12, now recites the limitation that plurality 
of compounds in the combinatorial library are prepared by the process recited in the 
claim. 

Applicant's argument have been considered, but are not deemed persuasive. 
The examiner respectfully submits that the patent reference of Palfreyman discloses a 
collection of individual pure compounds. These compounds of Palfreyman are related 
in that they share a common core structure that confers a common property, as 
demonstrated by the common purpose to which they may be put, i.e., treatment of 
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mental depression. The reference of Palfreyman discloses anticipatory compounds I, II 
and III together, and contemplates treating patients with any one of compounds I, II or 
III. The examiner respectfully submits that, as such, Palfreyman discloses compounds 
I, II and III as an ensemble of molecules, sharing a method of synthesis, molecular core 
structure, properties, and purpose. Even although applicant argues that Palfreyman 
does not teach a "library" of compounds; however, the term is used quite broadly in the 
art to mean "any ensemble of molecules" (e.g., see Janda, K.D. "Tagged versus 
untagged libraries: Methods for the generation and screening of combinatorial chemical 
libraries", Proc. Nat'l. Acad. Sci., USA, November 1994, 91, 10779-10785, especially p. 
10779, column 1, last sentence, "In its purest form, a combinatorial chemical library can 
be defined as any ensemble of molecules"). Therefore the examiner respectfully 
submits that the reference of Palfreyman et al. disclose a combinatorial library of 
compounds I, II and III. 

Applicant's traversal of the instant rejection, as understood by the examiner, 
appears to argue that the compounds of a library must be related by location, as either 
in a mixture, or distributed in an array. These limitations are not required in the 
description of the term "combinatorial library", found in the specification as filed at p. 7, 
lines 14-36, although the specification teaches such mixtures or arrays may be 
preferred. For example, the specification, at p. 7, state that "it may be preferred to 
provide the library as a large mixture of compounds", or that "[l]arge combinatorial 
libraries may also be prepared by massively parallel synthesis of individual compounds, 
in which case compounds are typically identified by their position within an array." 
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Therefore, the examiner respectfully submits that these limitations are not found in the 
claims, and are precatory in the specification's description of combinatorial libraries. In 
response to applicant's argument that the references fail to show certain features of 
applicant's invention, it is noted that the features upon which applicant relies (i.e., 
libraries must have compounds as mixtures or arrays) are not recited in the rejected 
claim(s). Although the claims are interpreted in light of the specification, limitations from 
the specification are not read into the claims. See In re Van Geuns, 988 F.2d 1 181, 26 
USPQ2d 1057 (Fed. Cir. 1993). 

Furthermore, it would have been obvious for one of ordinary skill in the art to 
have combined the compounds taught by Palfreyman, and thereby form a combinatorial 
library, because said compounds are used for a common purpose, i.e., the treatment of 
mental depression. See In re Kerhoven . 626 F.2d 846, 850, 205 USPQ 1069, 1072 
(CCPA 1980) (stating: "It is prima facie obvious to combine two compositions each of 
which is taught by the prior art to be useful for the same purpose, in order to form a third 
composition to be used for the very same purpose.... [T]he idea of combining them 
flows logically from their having been individually taught in the prior art" [citations 
omitted]); and MPEP 2144.06. 

New Claim Rejections 
New Claim Rejections - 35 (JSC §112 

The following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 
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7. Claims 12, 18-21 , 29, 35-39, 43 and 44 are rejected under 35 U.S.C. 112, 
second paragraph, as being indefinite for failing to particularly point out and distinctly 
claim the subject matter which applicant regards as the invention. 

Claims 12 and 44 recite the limitation "the compound 1 ' in the last line. There is 
uncertain antecedent basis for this limitation in the claims. 

New Claim Rejections - 35 USC § 102/103 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action; 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

This application currently names joint inventors. In considering patentability of 
the claims under 35 U.S.C. 103(a), the examiner presumes that the subject matter of 
the various claims was commonly owned at the time any inventions covered therein 
were made absent any evidence to the contrary. Applicant is advised of the obligation 
under 37 CFR 1 .56 to point out the inventor and invention dates of each claim that was 
not commonly owned at the time a later invention was made in order for the examiner to 
consider the applicability of 35 U.S.C. 103(c) and potential 35 U.S.C. 102(e), (f) or (g) 
prior art under 35 U.S.C. 103(a). 

8. Claims 12, 18-21 , 29, 35-39, 43 and 44 are rejected under 35 U.S.C. 102(b) as 
anticipated by or, in the alternative, under 35 U.S.C. 103(a) as obvious over Kick et al., 
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Kick et al. disclose collections of compounds (reading on the claimed 
"combinatorial library") that read directly on the instant formula 1. See, compounds 7a, 
7b and 7c in Kick et al., (p. 1428, para 3 and Figure 2). The compounds of the 
reference read on the claims where in R 1 is hydrogen, R 2 is carboxamido, R 3 and R 4 are 
independently hydrogen, and R 5 is heteroaryl. 

In the alternative, although Kick et al., does not disclose that their collections of 
compounds (i.e. libraries) are made by the claimed process, the products of Kick et al. 
would appear to be the same as those recited by the instant claim, regardless of their 
method of manufacture. See MPEP 21 13. Also note that many of the instant claims 
recite only process limitations and thus are non-limiting on the product. 

The collection of compounds of Kick et al. (reading on the claimed "combinatorial 
library") meet all of the limitations of the claimed library except for the product-by- 
process limitations and would either anticipate or render obvious the claimed library. 
"[E]ven though product-by-process claims are limited by and defined by the process, 
determination of patentability is based on the product itself. The patentability of a 
product does not depend on its method of production. If the product in the product-by- 
process claim is the same as or obvious from a product of the prior art, the claim is 
unpatentable even though the prior product was made by a different process." In re 
Thorpe . 777 F.2d 695, 698, 227 USPQ 964, 966 (Fed. Cir. 1985). 

The examiner respectfully submits that because the claim is drawn to a 
"combinatorial library", but this product is defined as a product-by-process, that any 
collection of compounds meeting the structural requirements of the instant claims reads 
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on this product. The process by which the claimed library is made does not appear to 
lend patentable weight to the claimed invention. One of ordinary skill would expect the 
library to be the same no matter how it was synthesized. 

9. Claims 12, 18-21 , 29, 35-39, 43 and 44 are rejected under 35 U.S.C. 102(b) as 
anticipated by or, in the alternative, under 35 U.S.C. 103(a) as obvious over Gordon et 
al., Bioorganic & Medicinal Chemistry Letters, 1995, Vol. 5, No. 1, pp. 47-50. 

Gordon et al. disclose collections of compounds (reading on the claimed 
"combinatorial library") that read directly on alpha-amino acid derivatives of the formula 
depicted in claim 37. See, the compounds of the Tables found on p. 48 of Gordon and 
Scheme 2 on p. 49, and the text on said pages. Gordon et al. teach ten different alpha 
amino acids, which are then attached to resin bead, and mixed, providing a 
combinatorial library of alpha amino acids. These compounds of the reference read on 
the claims where the combinatorial library contains alpha amino acids. 

In the alternative, although Gordon et al., does not disclose that their collections 
of compounds (i.e. libraries) are made by the claimed process, the products of Gordon 
et al. would appear to be the same as those recited by the instant claim, regardless of 
their method of manufacture. See MPEP 21 1 3. Also note that many of the instant 
claims recite only process limitations and thus are non-limiting on the product. 

The collection of compounds of Gordon et al. (reading on the claimed 
"combinatorial library") meet all of the limitations of the claimed library except for the 
product-by-process limitations and would either anticipate or render obvious the claimed 
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library. "[E]ven though product-by-process claims are limited by and defined by the 
process, determination of patentability is based on the product itself. The patentability 
of a product does not depend on its method of production. If the product in the product- 
by-process claim is the same as or obvious from a product of the prior art, the claim is 
unpatentable even though the prior product was made by a different process." In re 
Thorpe . 777 F.2d 695, 698, 227 USPQ 964, 966 (Fed. Cir. 1985). 

The examiner respectfully submits that because the claim is drawn to a 
"combinatorial library", but this product is defined as a product-by-process, that any 
collection of compounds meeting the structural requirements of the instant claims reads 
on this product. The process by which the claimed library is made does not appear to 
lend patentable weight to the claimed invention. One of ordinary skill would expect the 
library to be the same no matter how it was synthesized. 

Conclusion 

1 0. Claims 12, 18-21, 29, 35-39, 43 and 44 are rejected, Claims 34 and 40-42 are 
withdrawn. 

1 1 . Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Mark L. Shibuya whose telephone number is (571) 272- 
0806. The examiner can normally be reached on M-F, 8:30AM-5:00PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Andrew Wang can be reached on (571 ) 272-081 1 . The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 



Mark L. Shibuya 
Examiner 
Art Unit 1639 
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Tagged versus untagged libraries: Methods for the generation and 
screening of combinatorial chemical libraries 

Kim D. Janda 

Departments of Molecular Biology and Chemistry, The Scripps Research Institute, 10666 North Torrey Pines Road, La Jolia. CA 92007 



ABSTRACT Over the past two dec- 
ades the pharmaceutical industry has been 
driven by the biological sciences. The dis- 
covery and description of the biological 
mechanisms that underlie disease states 
accompanied by an. unraveling of these 
mechanisms has provided drug, and more 
recently biotechnological, companies with 
a barrage of new therapeutic targets. Par- 
adoxically, as a result of such biological 
and biochemical advances, new sources of 
drug leads are in short supply. Consider- 
able effort in trying to create potential 
drug candidates has led to the parturition 
of combinatorial chemical libraries* In this 
review. I will . examine some^ of the main 
technologies for generating and deducing 
active components from combinatorial li* 
brartes that have been segregated into two 
schools of thought: (0 the creation and 
decoding of combinatorial libraries by so- 
called tagged methodologies, and (it) the 
production and deconvolution of chemical 
libraries by untagged protocols. 

The random screening of natural prod* 
ucts from microbial fermentations, plant 
extracts, and marine sources for possible 
activity as therapeutic agents has been a 
rich source of new drug discoveries for 
years in the pharmaceutical sciences (1). 
However, with the advent of molecular 
biology and progress in crystallography 
and computational chemistry, "'rational 
drug design" has found many advocates. 
The knowledge of the three-dimensional 
structure of receptors or enzymes and the 
manipulation of this information has lead 
to the development of a number of drug 
candidates including angiotensin con- 
verting enzyme (2, 3) and renin inhibitors 
(4-6). But even such " rational" success 
stories, initiated from first principles and 
based solely on structural information 
about the macromolecule alone, have not 
proved as fast or reliable as once claimed . 
In recent years there has been a renais- 
sance in drug screening with an ingress of 
new technologies based on combinatorial 
chemical libraries (for review, see ref, 7), 
These methods expose many compounds 
to a target and allow the compounds that 
bind the target with the highest affinity to 
be filtered out from a pool of statistical 
, sequences. In its purest form, a combi- 
natorial chemical library can be defined 



as any ensemble of molecules, whereas Mimotope Strategy 
the production and ultimately the meth- 
ods of screening these combinatorial 
chemical libraries determine the 4 'hit 
rate" — i.e., the success or failure of 
these collections of molecules (8). 



Untagged Approaches to Generating 
Combinatorial Chemical Libraries 

The two general procedural methodolo- 
gies in formulating untagged combinato- 
rial libraries arc "mixture synthesis" and 
"portioning-mixing." Mixture synthesis 
is exactly what it sounds like, in that 
mixtures of chemical units are coupled to 
an activated support to produce chemical 
diversity (9-12). The advantage of such 
an approach is that combinatorial librar- 
ies of vast complexity, in theory, should 
be accessible. The problem of achieving 
such diversity is that product dispersal is 
strongly influenced by the relative kinetic 
rates of each competing chemical unit 
being coupled. Deviating from reactions 
where the kinetic constants for the addi- 
tion of each individual component are 
unknown could be disastrous to the prod- 
uct distribution. Portioning-mixing (13- 
15), or what has come to be known as 
combinatorial library split synthesis, is a 
two-step operation based on a divide- 
couple and rccombine procedure. The 
essentials of this strategy are that a poly- 
meric support is divided into equal por- 
tions for coupling to modular units [such 
units have typically been amino acids; 
however, in principle, any chemical moi- 
ety (say X) could be appended to the 
support]. The matrices are combined in a 
single vessel for washing and/or depro* 
tection and then divided again for the 
next coupling. Repeating this protocol 
for a total of n cycles can produce a 
stochastic collection of up to X n different 
molecules. More important, this strategy 
allows for an equal distribution of the 
coupled chemical units and for uniform 
couplings to occur. 

Various approaches for generating and 
screening untagged combinatorial chem- 
ical libraries have been developed. Each 
has advantages and disadvantages in its 
efforts to create chemical diversity* Sev- 
eral of the most prominent will be con- 
sidered below. 



Geysen and coworkers (11, 16, 17) have 
presented an amide linkage strategy 
wherein peptides can be synthesized in a 
reusable format. The peptides are, syn- 
thesized on polymer matrices, "pins," 
positioned within a microtiter 96-wcll 
plate. In this approach, natural L-amino 
acids, as well as unnatural D-amino acids 
(we will term either enantiomer X), can 
be incorporated into the library; thus, X n 
peptides are conceivable. However, to 
manage such large numbers with this pin 
format dipeptidc units are initially held 
constant, whereas the rest of the se- 
quence is formed from randomly incor- 
porated amino acids. To synthesize a 
hexapeptide library in this regime re- 
quires NNA'p^jNN, where now AT [31 
and Xn)* are defined amino acids at po- 
sitions 3 and 4, respectively, and N rep- 
resents positions where residues are ran- 
domly incorporated by using reaction 
mixtures containing natural or unnatural 
amino acids. If a primary set is made 
using an alphabet of 20 amino acids, then 
the size of this set will be 400 (20 x 20 
preparations), each of which is now a 
mixture of peptides theoretically consist- 
ing of 160,000 different peptides. The 
derivatization level will be at MOO nmol 
per pin or an average of **4 x 10 n copies 
of each of the individual peptides. 

Examining this primary set of 400 pep- 
tide mixtures for binding with an antibody 
or receptor allows identification of the 
optimum dipeptide sequence for X[y\X\4), 
A second screen (40 peptide mixtures) 
NNA: U) A: ( 4jD { 5iN and NDi2^ (J j^ l 4]NN, 
wherein the amino acids XpyV^i are fixed 
and D equates to positions where single 
amino acids are to be incorporated, allows 
resolution or absolute identification of ad- 
ditional residues. This cycle of synthesis 
and screening is reiterated until the entire 
hexapeptide sequence is optimized for 
binding to the target of interest. 

Originally this mimotope strategy was 
used to determine discontinuous epitopes 
with a protein antigen (12). Recently, its 
scope has been expanded to include the 
screening of other relevant receptors 

♦Subscripts In brackets throughout text rep- 
resent residue positions rather than multi- 
ples. 
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Fio, 1. Scheme of how peptide combinatorial libraries can be constructed using the 
light-directed, spatially addressable parallel chemical synthetic approach developed by Foder 
et at. (19). 



(18). The positive aspects of the mimo- 
tope scheme are that modular synthetic 
strategies are ultimately used in second- 
ary, tertiary, etc. ... screens. The li- 
brary diversity can also be increased by 
applying a-disubstituted and /3-amino ac- 
ids to the process. 

Light-Directed, Spatially Addressable 
Parallel Synthesis of Combinatorial 
Libraries 

Fodor and his colleagues at Affymax (19) 
have demonstrated a technology that in- 
tertwines solid-phase synthesis with pho- 
tolithography in the preparation of pep- 
tide and oligonucleotide combinatorial 
libraries (Fig. 1)* Their general procedure 
was to derivatize an aliphatic amino- 
terminated matrix with the light-sensitive 
amino-protecting group 3,4-dimethoxy- 
6-nitrobenzyloxycarbonyl (NVOC). Pho- 
tolysis of the surface removes the pro- 
tecting group and thereby activates the 
area for further synthesis. The synthetic 
scaling process that ensues depends on 
the photolithographic masking pattern 
used. Thus, after photolysis (deprotec- 
tion) the entire surface is exposed to the 



next NVOC-protected amino acid or oli- 
gonucleotide, wherein coupling of either 
unit occurs only in the regions exposed to 
light. The procedure is repeated until all 
the building blocks are coupled to the 
support. The pattern of masks and the 
sequence of reactants define the products 
and their locations. In other words, the 
identity of the sequence and its location 
are known. 



Screening for activity within these syn- 
thesized libraries is done via a fluores- 
cent reporter-group assay, wherein a flu- 
orescently tagged receptor or enzyme is 
allowed to interact with the library. The 
fluorescent intensity at each site will de- 
pend on the affinity of the receptor for the 
compound, the concentration of the re- 
ceptor, and the number and density of the 
sites resulting in the binding event. 

The strengths of this approach are that 
high-density arrays of chemical com- 
pounds can be created in a very small 
area. A 10-step binary synthesis with 
amino acids results in the formation of 
1024 peptides in 1.6 cm 2 . Furthermore, 
the chemistry is not limited to just pep- 
tide and oligonucleotide synthetic strat- 
egies. As evidence, the Schuitz group 
(20) has extended the Fodor technology 
to an oligocarbamate library (Fig. 2). 

One-Bead, One-Peptidc Solid-Support 
Technology 

A one-bead, one-peptide combinatorial 
library using, split synthesis was, pio- 
neered in a collaborative effort by work- 
ers at the University of Arizona and the 
Selectide Corporation (21). Their ap- 
proach is defined by a noncleavable 
linker moiety that attaches the peptide 
unit to the bead. Library screening is 
accomplished by using receptors conju- 
gated to fluorescein or an enzyme; either 
entity distinguishes an association event 
between relevant library members and an 
acceptor molecule from the rest of the 
population. Individual library member 
beads that are stained by this process are 
removed and analyzed by Edman micro- 
sequencing to deduce the sequence of the 
corresponding peptide iigand. 

Using this methodology, the Tucson 
group has synthesized a 19-amino acid 
(less cysteine) pentapeptide library 
(2,476,099 members). The library was 
screened against a monoclonal antibody 
known to bind the ^-endorphin epitope 
(YGGFL) with a of 17.5 nM, The 
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antibody 3E7 conjugated to alkaline 
phosphatase retrieved six new pentapep- 
tide-binding sequences with AT d values 
from 15 to 8780 nM. 

The power of this one-bead, one- 
peptide split synthesis concept lies in the 
fact that any solid-phase chemistry can 
theoretically be applied and investigated 
in a receptor-based and/or enzyme as- 
say* However, this advantage also leads 
to drawbacks, oneof which is that librar- 
ies must be screened attached to the 
support. Subsequent work has been di- 
rected at using multicleavable tinker ap- 
pendages to allow for screening of solu- 
ble peptide mixtures (22). 

Dual-Defined, Positional Scanning, and 
Robotics Library Technology 

Houghten et al (23-25), using what they 
termed a "dual-defined iterative" meth- 
odology, have assembled soluble combi- 
natorial peptide libraries via split synthe- 
sis. In the seminal paper (23) a dual- 
defined hexapeptide library containing 18 
amino acids was. constructed as follows:- 
Four cycles using solid-phase peptide syn- 
thesis and '^onioning-mixing'* provided 
104,976 protected tctrapeptide resin se- 
quences (18 4 ). This partial library of 
NNNN-resins (where N is a randomized 
amino acid position) was divided into 324 
aliquots, so that the synthesis of the next 
two positions X [i} X m NNNN-resin could 
be defined (X is an amino acid position 
that is defined; the bracketed subscript 
indicates residue position); upon depro- 
rection and cleavage from these respec- 
tive resins, a now complete soluble 
hexapeptide library of >34 million mem- 
bers was obtained. These 324 pools are 
assayed, and positive results for the first 
two residues (say A (lJ B UJ ) were noted, 
Next, 18 new libraries were synthesized 
with the formula A t i|B U) Ar t3| NNN, one for 
each amino acid at position 3, and tested 
to define X^ The process is repeated 
until all positions are defined. Essentially, 
this methodology is an iterated search 
process that consists of making the library 
in a number of segregated pools, finding 
the active pool that defines the entity for 
the position on the molecule, and then 
repeating the process until the active com- 
ponent has been identified. A simitar iter- 
ative process called SURF (synthetic un- 
randomization of randomized fragments) 
was used by ISIS Pharmaceuticals (Carls- 
'26)' 80 ol ' sonucleotide library 

A virtue of this dual-defined iterative 
technology is that the multiplicity of com- 
ponents decreases with each step, so that 
an enrichment process occurs, and be- 
cause molecules can be assayed in solu- 
tion, it permits functional, as well as 
binding, assays. An application of this 
iterative strategy includes the discovery 
of antimicrobial peptides with nr.rivinV* 
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against Gram-negative and Gram-posi- 
tive bacteria (23, 24). 

The positional scanning format is again 
based on soluble combinatorial libraries; 
however, its proof of concept has been 
shown to be viable using mixture synthe- 
sis (27 , 28) ; now for the same hexapeptide 
library of 18 amino acids, six different 
libraries (i.e., 108 positional scanning 
sublibraries) of the general formula 

^(l)N(2lNt3lN'HjN [3 ,N[6) t N(i)A:(2lN(3]- 

N[4|N ( 5]N t6l , N (ll N (2] Jr ( 3,N H )N m N«s), 
NmN^No^i^NiijN^], N m N m N (3) - 
N(4»*(5]N {61) and NiuN^NpiN^Nij)^] 
must be synthesized and assayed. This 
technology defines the preferred residue 
at each, position of the sequence. The 
technology also alleviates the unwieldy 
iterative synthesis and selection steps 
required in the dual-defined methodol- 
ogy. However, this strategy, unlike the 
dual-defined methodology, is not en- 
dowed with an enrichment process or a 
progressive improvement of the signal- 
to-noise ratio. 

The Chiron group has developed a fully 
automated peptide synthesizer that al- 
lows combinatorial peptide libraries to be 
created in the split-synthesis format (29, 
30). The instrument consists of an array 
of reaction vessels, solenoid valves, and 
a Zymark robot that is computer con- 
trolled. The use of this instrument was 
shown by synthesizing a 361-mcmber 
decapeptide library. Through competi- 
tive ELISA and an affinity-selcction 
methodology decapeptide library mem- 



bers were identified that bound an anti- 
gpl20 monoclonal antibody. 

A Recursive Deconvolution Strategy 

The final "nontagged" combinatorial li- 
brary methodology to be described is that 
of my group (31). The essence of our 
method is to build and hold a set of par- 
tially synthesized combinatorial libraries. 
A formal example of our method is shown 
in Fig. 3 for a library of degree 3, made 
from an alphabet of three components A 
B, and C, In our process we define three 
channels of synthesis, and each involves 
only the addition of a single component. 
Initiating the process requires the making 
of three pools, in which A, B; and C are 
added to a matrix adapted with a linker for 
coupling of the components. A portion of 
this library is set aside and labeled as 
partial library p(l). This portion or frac- 
tion to be saved and catalogued is the 
inverse of the degree of the library in the 
first step. Hence, in the example pre- 
sented* a> degree- of three; one-third of 
each pool is saved and labeled; for sub- 
sequent steps the amount saved and cat- 
alogued is the inverse of degree minus 
one. The remaining material is combined 
and separated into three portions, each 
channel is loaded, and A, B, and C are 
attached as before. Again, an aliquot of 
this library is set aside as partial library 
p(2), which now is three pools made up of 
N[i)A[2], N ai B [2 i, and N (ll C m . The re- 
mainder is again pooled and split, and the 



P(1)A 



0^ 



pd)B 



Save 




2/3 




T7T*~ P<DC 



Couple with: a 



P(2)A 




^ >™ pa,c 



1/2 




Couple with: ^ 



N I») N UI A |)| | 



N iu N ro B »i I 



N <i| N »|C|i|] 



Fig. 3. General scheme of how a chemical combinatorial library using the concent of 
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third step of addition is carried out to give 
the final library N U iN[2iA[ 3 |, N m N [31 B f ». 
and NmNpCw 

Screening against a receptor, ligand, or 
even an enzyme and then determining the 
most active library member(s) is done 
simply by examining the final three pools 
first and proceeding backward to the par- 
tial libraries saved. Thus, for the example 
presented in Fig. 3, wc have three pools, 

NmN(2yY ( 3i is A * B \ or nine com " 
pounds in each pool (i.e., a total of 27 

different compounds), which are tested by 
an appropriate assay, and the active pool 
is determined. Suppose NniN(ajB(jj from 
the final library shown in Fig. 3 is positive. 
We then go back to library p(2) and add B 
lo an aliquot of each of the three pools, 
P(2)A, P(2)B, P(2)C, to give three new 
libraries of the general formula N[iyr t21 - 
Bi3j. These three new libraries now con- 
tain only nine components, so a 3-fold 
enrichment has been achieved. Again, af- 
ter testing, suppose pool N t i]A U ]B m is 
active. Wc proceed to partial library P(l) 
and add A to each followed by B to give 
three new pools with the structure 
X(tjA[;)B[3j, which can be tested to find 
X\i\. Again, a 3-fold enrichment has been 
achieved; the structure is synthesized, 
and the sequence is deduced. 

Wc have examined this recursive de- 
convolution in a peptide combinatorial 
library that was tailored to contain penta- 
peptide sequences that display binding to 
the commercially available anti-^-endor- 
phin monoclonal antibody 3E7. In the 
final analysis, the native epitope YGGFL 
was found to be the most extensive 
binder; however, other weaker binders 
were also deduced through this strategy 
(31). 



identification of library members that in- 
teract with the biological target of interest. 
Although these three points have been 
addressed in several of the "untagged" 
combinatorial library protocols de- 
scribed, limitations do exist. In this next 
section alternative strategies will be pro- 
vided, which are termed "tagged 11 com- 
binatorial libraries. 

Phage Technology 

Arguably, one of the most powerful tag 
technologies is that of strictly biological 
origin. The general concept is one in 
which a library of peptides is presented 
on the surface of a bacteriophage such 
that each phage displays a unique peptide 
and contains within its genome the cor- 
responding DNA sequence (32-34). In 
detail, foreign DNA can be inserted into 
the minor coat protein locus (gene III) of 
filamentous phage to create fusion phage 
that express these corresponding pep- 
tides at the f4 terminus of the absorption 
protein (pill), which is displayed on the 
phage surface (Fig. 4). The diversity of 
displayed peptides is generated by clon- 
ing randomly synthesized oligonucleo- 
tides that are inserted into a specific 
region of gerte III. These phage libraries 
encoding peptide units can possess as 
many as 10 9 unique sequences, which can 
be screened for binding to any type of 
immobilized receptor in a selection pro- 
xess known as, ** panning/' This .method 
uses an affinity capture technique to se- 
lect peptide display phage that bind to the 
receptor of interest. Selected phages are 
amplified by infecting £. co// t whereas 
each cycle of panning and amplification 



enriches certain peptide display phage 
sequences that bind most tightly to the 
receptor molecule. After the panning 
process, DNA from the isolated phage is 
sequenced, and the peptide responsible 
for binding is elucidated. 

The application of phage technology to 
the binding of receptors (antibodies) has 
been demonstrated by several groups 
(35, 36), including Smith's group (34), 
who have shown that an epitope 
hexapeptide library («10 7 members) dis- 
played within the pill protein could pro- 
vide individual members that bound to 
two different antibodies that were specif- 
ically made to the DFEKI peptide unit 
found on the surface of myohemerythrin. 
In a library designed by Devlin et ai (37) 
phage display epitope libraries with 15, 
rather than 6, amino acids were made. 
This approach not only increases the 
effective size of the library, it also allows 
the possibility of the display of discon- 
tinuous epitopes. 

The strengths of phage technology are 
that large combinatorial libraries of pep- 
tide fragments can be generated very 
quickly and efficiently. Of greater impor- 
tance is that these peptide libraries are 
physically linked to their own encoding 
tag (DNA), which allows for user- 
friendly amplification, enrichment, and 
decoding of pertinent binding sequences. 
At first glance, this methodology seems 
very appealing; however, a glaring weak- 
ness inherent in this technology and other 
"genetic 11 library methodologies is that 
the overall diversity of the chemical units 
that can be applied within these systems 
is finite. Exploitation of these libraries 
for drug discovery may thus be limited. 



There are a number of advantages in 
using this recursive deconvolution strat- 
egy. (0 Split synthesis, a rather cumber- 
some process, need only be done once for 
each combinatorial library made. In stark 
contrast is the dual-defined method, 
which requires numerous split synthetic 
operations, (if) The deconvolution pro- 
cess recursively defines the synthesis of 
the active component, so that in the last 
cycle, the active compound is synthe- 
sized. In addition, this methodology al- 
lows the deduction of alternative active 
members, as each deconvolution pathway 
can be followed either in parallel or suc- 
cessively, (m) Any chemistry is applicable 
with this technology, which, as wc will 
see, can be problematic in generation of 
encoded combinatorial libraries. 

Tagged Methodologies to Generating 
Combinatorial Chemical Libraries 

Evident from the discussion presented is 
that there are three critical aspects in any 
combinatorial library: (») The chemical 
units that go into the library, (//) the lech- 
niaue for ccneralinn the library, and (Hi) 
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peptldes-on-Plasmlds 

An alternative to phage technology, which 
also relies on a biological tack is a genetic 
combinatorial library approach termed 
"peptides-on-plasmids." What Schatz 
and his coworkers at AfTymax (38) have 
done is establish another efficient meth- 
odology for a physical connection be* 
tween a peptide and nucleic acid that 
encodes for it. In their procedure, a li- 
brary of peptides is constructed so that the 
genetic material encoding them is linked 
through the DNA-binding activity of the 
lac repressor protein. The random pep* 
tides are fused to the C terminus of the 
repressor by cloning degenerate oligonu- 
cleotides at the 3' of the repressor gene 
(lac I) present on a plasmid. This plasmid 
also has lac repressor-binding sites, so the 
fusions bind the same plasmid that en- 
codes them. Assay for pepttde-receptor 
binding in the Schatz approach is quite 
similar to the panning process in phage 
methodology; however, now cell lysis is 
used to liberate the peptide-lac-plasmid 
complexes that are screened repeatedly 
for affinity enrichment. 

A proof of principle of this work came 
from the construction of a random dodec- 
amer library. This library was used to 
probe for potential peptide members that 
could bind to IgG D32.39, a specific hy* 
bridoma to the dynorphin B epitope. As 
expected, a consensus sequence was dis- 
covered that corresponded to similar 
binding studies of phage libraries (35). 
More recently, this peptide s-on-plasmid 
approach has been extended to a more 
challenging system — the discovery of new 
substrates for E. coll biotin haloenzyrne 
synthetase. The results were that smaller 
peptide units (13-residue sequences) 
could be used as substrates (39). 

Two features distinguish the peptides- 
on-plasmid approach from phage tech* 
nology. (0 Random peptides are dis- 
played with a free C terminus, (tf) The 
repressor fusions are cytoplasmic, 
whereas phage fusions are periplasmic. 
However, as with phage technology this 
methodology is restricted in its-repertoire 
of chemical building blocks to oligopep- 
tide units. 

Peptide Coded Libraries 

In an attempt to address the diversity 
problem inherent in genetic libraries while 
still retaining the advantages of a tagging 
unit, the Chiron Corporation group de- 
vised a chemical approach with peptides 
as the encoding unit (40). The chemical 
combinatorial strategy uses resin-splitting 
peptide synthesis to alternately synthe- 
size a "binding" strand and a "coding" 
strand. An orthogonal protecting-group 
scheme is also used to allow for the par- 
allel synthesis of both chemical units on 
the resin shown in Fig. 5. 



The isolation of receptor-binding ii- 
gands from a tagged library of this type 
can be done by affinity-selection or bead- 
staining techniques. The identification of 
the binding sequence can be determined 
by Ed man sequencing. However, when 
using such sequencing technology, the 
binding strand, if of peptide composition, 
must be made nonsequenceable. In an 
analogous fashion, the Selective Corpo- 
ration group (41) has also described a 
peptidic coding approach. In their report, 
protecting-group schemes are used that 
allow an assay for library receptor bind- 
ing, either on the matrix or in a soluble 
format. 

The most desirable feature of this type 
of peptide encoding strategy is that it 
grants a potential for alternative chemical 
units other than the natural amino acids 
(or nucleotides) to be incorporated into 
the binding strand. However, this meth- 
odology does not allow for enrichment by 
serial selection, the cornerstone upon 
which genetic library selection methods 
are founded. 

Electrophortc Polyhalobenzene Coded 
Libraries 

Another solution to the chemical coding 
of combinatorial libraries has been dis- 
closed recently by Still and coworkers 
(42). In their approach, a combinatorial 
peptide library attached to beads was as- 
sembled using the split synthesis method, 
while simultaneously being indexed by 
what they refer to as electrophone tag- 
ging. In their scheme, a series of aromatic 
electrophores varying in hydrocarbon 
chain, length (Fig* 6) are used as tagging 
units. Tagging is done by an alternating 
synthetic process that coincides with the 
addition of each library binding unit; how- 
ever, the tagging process here does not 
require sequential connection. To further 
simplify the entire scheme, the tags are 
used in a binary code to record the addi- 
tion of each building block and, thus, the 
reaction history of each bead. 

Screening of the library is done by a 
reporter-group assay (vide infra), and 



identified beads with affinity to the re- 
ceptor are individually picked out by a 
microptpetter. Use of the o-nitrobenzyl- 
carbonate moiety on the linker portion of 
each coding unit allows release of tag 
alcohols through photolysis, which can 
be analyzed by electron capture gas chro- 
matography and identified by the binary 
synthesis code. Borchardt and Still (43) 
have advanced this technology to the 
synthesis of a combinatorial N-acylated 
tripeptide library using both d- and 
L-amino acids. This library was used to 
probe the binding requirements of a syn- 
thetic receptor. The outcome of this 
study was that it allowed new types of 
host-guest interactions to be observed 
that might not have been discovered by 
conventional studies. 

A clear advantage of the Still coding 
technology is that numerous types of 
chemical processes are amenable to it. 
Thus, the potential to create highly di- 
versified chemical libraries is well within 
its purview. The tagging methodology is 
elegant, as no cosynthesis is required. 
Again, however, this methodology, like 
the peptide-tagging technology, does not 
allow for amplification or enrichment to 
operate. 

Encoded Combinatorial Libraries 

If the breadth and versatility of chemical 
synthesis could be linked to the power of 
genetics, a potentially more diverse ap- 
proach to tagged combinatorial libraries 
could be achieved. A conceptual method 
to bridge the gap between these two dif- 
ferent realms and apply it to combinatorial 
libraries was advanced by Brenner and 
Lerner (44) and termed encoded combi- 
natorial chemistry. In their theory, to 
carry out encoded combinatorial chemis- 
try (Fig. 7) two alternating parallel com- 
binatorial syntheses must be done so that 
a genetic tag is chemically linked to the 
chemical structure being synthesized. 
Thus, addition of a chemical unit to a 
matrix is followed by addition of an oli- 
gonucleotide sequence that codes the 
chemical unit appended. As with all chem- 
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for alternating synthesis of the binding and coding strands in peptide coded libraries. FMOC, 
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ical lagging technologies, split synthesis is 
applied in making the library. However, 
where this encoded approach differs from 
other coded synthetic methodologies is in 
the selection process between library 
members and receptors. Now active li- 
brary molecules are affinity-selected to a 
receptor, and amplified copies of their 
retrogenetic tags are obtained via PCR. 
DNA strands that are amplified can be 
used to enrich for a subset of the library by 
hybridization with the matching tags, and 
this type of panning process can be re- 
peated with this subset. Thus, serial en- 
richment js achieved by a process of pu- 
rification. Ultimately, the DNA of binding 
members can be decoded to provide the 
chemical history and, hence, the structure 
of the binding unit. 



The general principles of this technol- 
ogy are straightforward. The chemical 
manipulations to create such libraries can 
be complex. If one just considers synthe- 
sizing a chemical library of peptides, new 
linker technologies, protecting-group 
schemes, and, moreover, synthetic pro- 
tocols must be contemplated and devised. 
At the time when this encoded combina- 
torial chemical library approach was dis- 
closed, the alternating parallel synthesis 
of peptides and oligonucleotides had yet 
to be described. My group recently dem- 
onstrated the synthetic methods needed 
for implementation of this type of mixed 
chemical synthesis (45, 46). The gist of our 
tack, was to jnake.solid-phase peptide *nd 
oligonucleotide synthesis compatible. We 
accomplished this through the use of 9-flu- 
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orenylmethyloxycarbonyl (Fmoc) chem- 
istry for the peptide portion of the mole- 
cule, whereas a methyl phosphate scheme 
was engaged for the oligonucleotide- 
encoding unit. Following our lead, similar 
synthetic methodologies applied to beads 
have also been published by the group at 
Affymax (47). In their program, an oligo- 
nucleotides ncoded d, l peptide library 
was made on a 10-/*m support. Screening 
the library for binding to an epitope on> 
dynorphin B provided a number of pep- 
tide members after amplification and de- 
coding. 

The fervor that encoded combinatorial 
libraries brings to the chemical library 
field is evident, as it exploits the best of 
both worlds— chemical diversity and the 
power of genetics. The main constraints 
in using this technology come from its 
strength—namely, the tagging unit. Oli- 
gonucleotides can be chemically labile 
and incompatible with certain synthetic 
procedures. However, even with these 
limitations, the elegance of this approach 
is hard to surpass. 



Concluding Remarks and Future 
Directions 

Within a very short time span, a large 
body of work has accumulated within the 
field of combinatorial chemical libraries. 
As with any new scientific endeavor! 
different paths of research are explored 
in trying to advance the subject. I have 
presented what has come to be a philo- 
sophical divide within the scientific com- 
munity on combinatorial libraries: the 
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notion of tagging the combinatorial enti- 
ties or leaving them as untagged compo- 
nents. Clearly, strong arguments can be 
made in defense of either technology, but 
ultimately the deciding factor as to what 
methodology will be engaged depends on 
user need. Addressable libraries or ones 
procured by split synthesis for an itera- 
tive or recursive deconvolution approach 
iave the potential to create the most 
diverse combinatorial libraries. In con- 
trast, coded libraries, specifically those 
of genetic origin, allow for the screening 
of receptors that are rare and/or of lim- 
ited concentration, 

Currently, most available combinato- 
rial libraries (tagged or untagged) are of 
either peptide or nucleotide origin. It 
appears that the next wave of combina- 
torial research will be directed at the 
.Jesign of libraries that are devoid of the 
repetitive backbone linkage found within 
peptides or nucleotides. It is here that 
stroctural/stereoelectronic variation and 
unconstrained motifs will be allowed to 
expand to unparalleled combinatorial 
chemical diversity. Already important 
advances in this area by Bunin and Ell- 
man (43) and Hobbs Dewitt et al (49) 
have provided us with a glimpse of pos- 
sible developments. 

Finally, it should be noted that combi- 
natorial chemical libraries can provide us 
with another paradigm for drug discovery 
and development: Although the field is 
still young, the methods/technologies for 
generating and screening these libraries 
is already quite diverse and is becoming 
increasingly more sophisticated. The 
quest for the future is to find whether 
combinatorial chemical libraries can pro- 
vide us with more than just "lead** 
sources for drug discovery. As long as 
the essence of drug discovery is to find 
quick and cost-effective new drug candi- 
dates, combinatorial chemical libraries 
provide a unique and ever changing 
source, of chemical diversity. 
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Introduction, Aspartic acid proteases are a widely 
distributed family of enzymes that play important roles 
in fungi, plants, vertebrates, and retroviruses. 1 The 
aspartic acid proteases (characterized by having two 
aspartic acid residues in the active site) catalyze the 
hydrolysis of amide bonds with specificity for peptide 
bonds located between large hydrophobic residues. A 
number of aspartic acid proteases are important phar- 
maceutical targets, including renin, 2 cathepsin D, 3 the 
human immunodeficiency virus (HIV) protease, 4 * 5 hu- 
man t-cell leukemia virus type 1 (HTLV-1) protease, 6 
and Candida albicans aspartic acid protease. 7 

Potent inhibitors of these enzymes can be readily 
accessed by the incorporation of an isostere that mimics 
the geometry of the tetrahedral intermediate in place 
of the scissile bond of the peptide substrate. 2 *' 8 Unfor- 
tunately, these inhibitors have limited therapeutic 
utility, due to the poor oral availability and/or short- 
circulating half-lives that result from their peptidic 
nature. For this reason, there has been a great deal of 
work toward the development of aspartic acid protease 
inhibitors that display nonpeptide functionality about 
the isostere of the tetrahedral intermediate. 2 * 4 In order 
to generate a therapeutically useful inhibitor the com- 
pound must have both high affinity and favorable 
pharmokinetic properties. The combination of these two 
requirements render an a priori design of inhibitor 
structure based on the peptide substrate very challeng- 
ing. Therefore, the identification of potent and bioavail- 
able inhibitors has required the time-consuming syn- 
thesis and evaluation of a large number of different 
nonpeptidic compounds. 

Herein we report a general and high-yielding solid- 
phase method for the rapid display of nonpeptide 
functionality about molecules incorporating the (hy- 
droxyethyDamine and (hydroxyethyDurea isosteres. 9 Of 
the possible isosteres upon which to construct a library 
of potential nonpeptide inhibitors, the (hydroxyethyl)- 
amine and (hydroxyethyl)urea isosteres were selected 
for two reasons. First, several orally available HIV-1 
protease inhibitors that incorporate these isosteres have 
been identified (Figure l), 4 including compounds that 
are currently in clinical trials for the treatment of HIV 
infection. Second, we believed that solid-phase methods 
could be developed to display a wide range of diverse 
functionality about these isosteres. 

Initially we chose to display functionality from scaf- 
fold l, 10 which provides access to known HIV-1 protease 
inhibitors. The scaffold was first coupled to dihydro- 
pyran functionalized polystyrene support by employing 
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pyridinium p-toluenesulfonate in 1,2-dichloroethane 
(Scheme l). u The reaction progress was qualitatively 
monitored by IR by following the appearance of the 
azide stretch. The exact loading level of the resin was 
based on the mass balance of recovered alcohol 1, which 
was obtained by subjecting a portion of the resin 2 to 
cleavage by 95:5 trifluoroacetic acid (TFAYwater. 12 

The synthesis was initiated by displacement of the 
primary tosyl alcohol with either ftinctionalized or 
unfunctionalized primary or secondary amines, includ- 
ing amines found in known HIV-1 protease inhibitors 
(Scheme 2, 8a-h and 41). After coupling of the primary 
amines, the resulting secondary amine products 3a- g 
can be converted to ureas by reaction with isocyanates 
(4a -f) or by stepwise treatment with triphosgene fol- 
lowed by amine addition (4g). 13,14 The ability to employ 
either a preformed isocyanate or the stepwise procedure 
to synthesize the ureas provides ready incorporation of 
functionality from both commercially available isocy- 
anates and the even larger pool of commercially avail- 
able amines. Notably, substituted ureas have been 
employed successfully at this site in a number of 
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Figure 2, (Hydroxyethyl)amine and (hydroxyethyl)urea derivatives synthesized on solid support. Yields of analytically pure 
material after chromatography were determined from mass balance and were based upon the initial loading of alcohol 1. Elemental 
analyses were all within ±0.4% of theoretical value, 



In summary, we have been able to obtain good yields 
(47-86%) of molecules incorporating the (hydroxyethyl)- 
amine and (hydroxyethyl)urea igosteres after four to six 
chemical transformations on solid support. In addition, 
we have been able to incorporate many of the functional 
groups and structures that are present in known inhibi- 
tors of HIV- 1 protease and renin, thereby demonstrating 
the generality of the synthesis sequence. Employing the 
described synthesis method, the simultaneous synthesis 
of a library of potential aspartic acid protease inhibitors 
is in progress, as is the evaluation of the library against 
a number of aspartic acid protease targets. 22 
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REDUCTIVE ALKYLATION ON A SOLID PHASE: SYNTHESIS OF A 
PIPERAZINEDIONE COMBINATORIAL LIBRARY 

David W. Gordon and John Steele* 

Discovery Chemistry Department > Pfizer Central Research, Sandwich, Kent CT13 9NJ, UK 



Abstract: The synthesis of a prototype trisubstituted piperazinedione combinatorial library of 1,000 
compounds has been achieved from three precursor sets - two sets of ten a-amino acids and one set of ten 
aldehydes, A sodium triacetoxyborohydride-mediated reductive alkylation was crucial to the success of the 
multi-step synthesis on resin. This protocol represents a new method to augment compound files rapidly with 
novel heterocyclic endues for high-speed screening. 

Combinatorial synthesis of libraries containing linear peptide and non-peptide structures is widely recognised as 
a valuable new tool for lead discovery in the search for new therapeutics. Of the methods for achieving the 
synthesis of large numbers of library components, one of the most flexible is the resin-based mix and split 
technology, first described by Furka and co-workers in 1988 1 and subsequently developed further by several 
other groups. 2,3 The speed and convenience of screening compound mixtures is a powerful aspect of this 
technology. In general, this technique has been restricted to peptide- type coupling reactions, 2 although solid- 
phase chemistry for the assembly of some heterocycles as individual compounds is now becoming established. 4 
We describe here the solid-phase synthesis of a prototype combinatorial library of 1,000 piperazinediones 
(diketopiperazines or DKPs) 5, each containing three centres of molecular diversity. The rigid DKP template 
was selected as a versatile* heterocyclic scaffold on which to array pendant functionality, 
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Scheme J contains the synthetic protocol for a representative DKP derived from Fmoc-Phe-Wang resin 1 , 
Boc-Ala and p-methoxybenzaldehyde. 
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The reductive alkylation of peptides on MBH 5 and Merrifield 6 resins using sodium cyanoborohydride in 
acidified DMP is a well-established process but pH sensitivity and potential reagent toxicity made this an 
unattractive option for the present work. We have instead used an efficient sodium triacetoxyborohydride- 
facilitated alkylation of amino acids such as 2 on Wang resin by a range of aldehydes, affording an array of 
novel, resin-bound secondary amines including 3. Yields for the general process 2-*3 are typically 85-95%, 
although partial racemisation of some homochiral amino acids (notably Phe as shown) was observed. This 
reductive alkylation has proved compatible with a wide range of natural and synthetic a-amino acids and is 
tolerant of almost any aldehyde, although in a potential 'worst case' combination of a hindered amino acid (e.g. 
Val, see later) and an electronically deactivated aldehyde, isolated yields of -20% were obtained, the balance of 
recovered material being the unreacted amino acid. Aliphatic aldehydes also gave minor but significant amounts 
of bis alkylation (1-10%), generating tertiary amines. Coupling of a relatively unreactive secondary amine (eg 
3) to a further Boc-protected amino acid proved to be a greater chemical challenge. After examination of a range 
of activated ester coupling techniques, this transformation was best achieved in our hands using a PyBrOP 
mediated double-coupling protocol, generating an amide (eg 4). A double coupling was found to be essential to 
achieve consistent yields of >90%. The optimal method of assembling target DKPs such as 5 employed a TFA- 
mediated Boc deprotection and concomitant resin cleavage followed by a short reflux of the evaporated filtrate in 
toluene to induce cyclisation. Without the thermolysis in toluene, no DKP formation was observed in any of our 
work. In the specific case given in Scheme l t the overall yield for synthesis of the pure DKP 5 was 42% 
based on resin-bound amino acid with trace amounts of bis-alkylated Phe recovered as the only identifiable by* 
product. Chiral HPLC analysis 7 of a sample of the reductive alkylation product 3 after resin cleavage indicated 
slight racemisation, presumably occurring prior to reduction of the intermediate imine. 



The monomeric components used for the prototype piperazinedione library are given in the Tables below: 
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Reductive alkylation on a solid phase 
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To assemble a prototype combinatorial library of 1,000 piperazinediones, the protocol in Scheme 2 was used: 8 
floAMINO ACIDS ON RESIN | 



Scheme 2: 
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Ten Fmoc-protected amino acids 
6 on Wang resin were intimately 
mixed and then distributed as 
equivalent mixtures into ten 
reactor vessels. Each mixture 
was deprotected using 
piperidine and reductively 
alkylated with a single aldehyde. 
Samples of the resultant 
secondary amines on resin 7 
(-5% by wt) were retained for 
analysis and subsequent 
synthetic follow-up of any 
biologically active mixtures. The 
remaining resins were mixed 
again and redistributed into ten 
further mixtures. Each mixture 
was coupled to a single amino 
acid, then die. resultant products 
8 were cleaved and cyclised as 
described earlier for the single 
DKP synthesis to afford ten 
mixtures, each containing 100 
DKPs 9 - overall constructing a 
total of 1.000 compounds 
from an Initial pool of 27 

contributing mutts. 



The comprehensive analysis of multicomponent mixtures presents a significant problem. Clearly, some of the 
1,000 components may be under-represented as a consequence of two or more consecutive low yields. The aim 
of our analysis has been to fully validate the chemistry, then characterise intermediate stages in the library 
synthesis to generate confidence that most (>95%) of the expected components are likely to be present. For the 
DKP library, this was achieved by assembly and full characterisation of a range of individual, pure DKPs 
including representative syntheses of potential * worst-case' products such as 9 (RWpr, R 3 = l Bu, R 2 =3,4,5- 
Lrimethoxyphenyl - 24% overall). For the intermediate ten-component mixtures of secondary amino acids 
(prepared by TFA digestion of 7), HPLC-MS combined with MS-MS was used to unambiguously confirm the 
presence of 96 of the expected 100 components (ref 9 contains die experimental protocol used for this analysis). 
Spectroscopic methods are still underdevelopment for the effective analysis of 100-component mixtures. 
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